The Performance of Statistical Pattern Recognition Methods in High Dimensional Settings

نویسندگان

  • Stefan Aeberhard
  • Danny Coomans
  • Olivier de Vel
  • James Cook
چکیده

We report on an extensive simulation study comparing eight statistical classiication methods, focusing on problems where the number of observations is less than the number of variables. Using a wide range of artiicial and real data, two types of classiiers were contrasted; methods that classify using all variables, and methods that rst reduce the number of dimensions to two or three. The full feature space methods include linear, quadratic and regularized discriminant analysis, and the nearest neighbour method. The four dimensionality reducing classiiers are characterized by the transform they implement. The four transforms compared are the Fisher discriminant plane, the Fisher-Fukunaga-Koonz, the Fisher-radius, and the Fisher-variance transforms. The Fisher-Fukunaga and the Fisher-radius transform based classiiers have recently been proposed for two class classiication problems. We also present an extension to these transforms such that they can be applied to classiication problems with arbitrary numbers of classes. The simulations identiied reg-ularized discriminant analysis as the overall clearly most powerful classiier. The results show that in most cases, a reduction of the dimensionality to two or three dimensions prior to classiication decreases the probability of correctly allocating test observations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Feature Extraction of Face Images for Improvement of Recognition Accuracy

Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...

متن کامل

Pattern Recognition in Control Chart Using Neural Network based on a New Statistical Feature

Today for the expedition of the identification and timely correction of process deviations, it is necessary to use advanced techniques to minimize the costs of production of defective products. In this way control charts as one of the important tools for the statistical process control in combination with modern tools such as artificial neural networks have been used. The artificial neural netw...

متن کامل

Steel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps

Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...

متن کامل

A New Statistical Approach for Recognizing and Classifying Patterns of Control Charts (RESEARCH NOTE)

Control chart pattern (CCP) recognition techniques are widely used to identify the potential process problems in modern industries. Recently, artificial neural network (ANN) –based techniques are very popular to recognize CCPs. However, finding the suitable architecture of an ANN-based CCP recognizer and its training process are time consuming and tedious. In addition, because of the black box ...

متن کامل

Principal Component Analysis for Data Compression and Face Recognition

Data compression is the most important step in many signal processing and pattern recognition applications. We come across very high dimensional data in such applications. Before processing of large-dimensional datasets, we need to reduce the dimensions to have lesser storage space and reduced computational complexities while retaining the maximum information. Principal Component Analysis (PCA)...

متن کامل

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994